16 research outputs found
RecycleGPT: An Autoregressive Language Model with Recyclable Module
Existing large language models have to run K times to generate a sequence of
K tokens. In this paper, we present RecycleGPT, a generative language model
with fast decoding speed by recycling pre-generated model states without
running the whole model in multiple steps. Our approach relies on the
observation that adjacent tokens in a sequence usually have strong correlations
and the next token in a sequence can be reasonably guessed or inferred based on
the preceding ones. Experiments and analysis demonstrate the effectiveness of
our approach in lowering inference latency, achieving up to 1.4x speedup while
preserving high performance.Comment: Technical Repor
Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer
Funding: UK EPSRC grants âDiscoveryâ EP/P020631/1, âABC: Adaptive Brokerage for the Cloudâ EP/R010528/1.This article presents an automatic k-means clustering solution targeting the Sunway TaihuLight supercomputer. We ïŹrst introduce a multilevel parallel partition approach that not only partitions by dataïŹow and centroid, but also by dimension, which unlocks the potential of the hierarchical parallelism in the heterogeneous many-core processor and the system architecture of the supercomputer. The parallel design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 targeting centroids, while maintaining high performance and high scalability. Furthermore, we propose an automatic hyper-parameter determination process for k-means clustering, by automatically generating and executing the clustering tasks with a set of candidate hyper-parameter, and then determining the optimal hyper-parameter using a proposed evaluation method. The proposed auto-clustering solution can not only achieve high performance and scalability for problems with massive high-dimensional data, but also support clustering without sufïŹcient prior knowledge for the number of targeted clusters, which can potentially increase the scope of k-means algorithm to new application areas.PostprintPeer reviewe
Large-scale hierarchical k-means for heterogeneous many-core supercomputers
Funding: J.Thomson and T.Yu are supported by the EPSRC grants âDiscoveryâ EP/P020631/1, âABC: Adaptive Brokerage for the Cloudâ EP/R010528/1, and EU Horizon 2020 grant Team-Play: âTime, Energy and security Analysis for Multi/Many-core heterogenous PLAtformsâ (ICT-779882, https://teamplay- h2020.eu)This paper presents a novel design and implementation of k-means clustering algorithm targeting the Sunway TaihuLight supercomputer. We introduce a multi-level parallel partition approach that not only partitions by dataflow and centroid, but also by dimension. Our multi-level (nkd) approach unlocks the potential of the hierarchical parallelism in the SW26010 heterogeneous many-core processor and the system architecture of the supercomputer. Our design is able to process large-scale clustering problems with up to 196,608 dimensions and over 160,000 targeting centroids, while maintaining high performance and high scalability, significantly improving the capability of k-means over previous approaches. The evaluation shows our implementation achieves performance of less than 18 seconds per iteration for a large-scale clustering case with 196,608 data dimensions and 2,000 centroids by applying 4,096 nodes (1,064,496 cores) in parallel, making k-means a more feasible solution for complex scenarios.Postprin
Dynamic Analysis of a Large Deployable Space Truss Structure Considering Semi-Rigid Joints
Joints are widely used in large deployable structures but show semi-rigidity due to performance degradation and some nonlinear factors affecting the structureâs dynamic characteristics. This paper investigates the influence of semi-rigid joints on the characteristics of deployable structures in orbit. A virtual connection element of three DOFs is proposed to model the semi-rigid joints. The governing equations of semi-rigid joints are established and integrated into the dynamic equation of the structures. A series of numerical experiments are carried out to validate the proposed modelâs accuracy and efficiency, and the deployable truss structuresâ static and dynamic responses are analyzed. The results show that semi-rigid joints exacerbate the effects of an in-orbit microvibration on the stability of deployable truss structures. Semi-rigid joints lower the dominant frequencies of structures, leading to a âclosely-spaced-frequenciesâ phenomenon and altering the dynamic responses significantly. The effects of semi-rigid joints on deployable truss structures are long-term and can be used to establish a relationship model between structural performance and service life. Nonlinear effects vary with the external load and depend on the structuresâ instantaneous status. These results indicate that semi-rigid joints significantly influence the characteristics of deployable structures, which must be considered in the design and analysis of high-precision in-orbit deployable structures
Electrospun Zein/Polyoxyethylene Core-Sheath Ultrathin Fibers and Their Antibacterial Food Packaging Applications
The purpose of this work is to develop a novel ultrathin fibrous membrane with a core-sheath structure as antibacterial food packaging film. Coaxial electrospinning was exploited to create the core-sheath structure, by which the delivery regulation of the active substance was achieved. Resveratrol (RE) and silver nanoparticles (AgNPs) were loaded into electrospun zein/polyethylene oxide ultrathin fibers to ensure a synergistic antibacterial performance. Under the assessments of a scanning electron microscope and transmission electron microscope, the ultrathin fiber was demonstrated to have a fine linear morphology, smooth surface and obvious core-sheath structure. X-ray diffraction and Fourier transform infrared analyses showed that RE and AgNPs coexisted in the ultrathin fibers and had good compatibility with the polymeric matrices. The water contact angle experiments were conducted to evaluate the hydrophilicity and hygroscopicity of the fibers. In vitro dissolution tests revealed that RE was released in a sustained manner. In the antibacterial experiments against Staphylococcus aureus and Escherichia coli, the diameters of the inhibition zone of the fiber were 8.89 ± 0.09 mm and 7.26 ± 0.10 mm, respectively. Finally, cherry tomatoes were selected as the packaging object and packed with fiber films. In a practical application, the fiber films effectively reduced the bacteria and decreased the quality loss of cherry tomatoes, thereby prolonging the fresh-keeping period of cherry tomatoes to 12 days. Following the protocols reported here, many new food packaging films can be similarly developed in the future
Clinical efficacy and safety of platelet-rich plasma in arthroscopic full-thickness rotator cuff repair: A meta-analysis.
BackgroundArthroscopic repair of rotator cuff tears, although commonly performed, carries the risk of retears. Therefore, bioremediation techniques such as platelet-rich plasma injections have been used as adjuvant therapies. The clinical efficacy of platelet-rich plasma in the arthroscopic repair of full-thickness rotator cuff injury is controversial. We performed a meta-analysis to evaluate the clinical effectiveness and safety of platelet-rich plasma and provide evidence-based medical recommendations for selecting the proper clinical treatment plan for full-thickness rotator cuff injuries.MethodsA search for the terms "platelet-rich plasma" and "rotator cuff" was performed in the PubMed, EMBASE, and Cochrane Library databases using a computer. After conducting quality evaluations and data extraction, RevMan 5.3 software was used to combine the effect sizes, and the GRADEpro Guideline Development Tool was used to rate the level of evidence from aspects of functional score, pain score and retear rate.ResultsEight randomized controlled trials involving 566 patients were included. The long-term retear rate(RR = 0.96, 95% CI [0.52, 1.78], P = .89), Constant score(RR = 0.96, 95% CI [0.52, 1.78], P = .89), and Visual Analog Scale score for pain (SMD = -0.28, 95% CI [-0.60, 0.04], P = .08), as well as both the long-term and short-term Disabilities of the Arm, Shoulder, and Hand scores(SMD = -0.13, 95% CI [-0.44, 0.18], P = .41;SMD = -0.02, 95% CI [-0.40, 0.36], P = .93), were not significantly different between the platelet-rich plasma and control groups. However, the short-term retear rate(RR = 0.29, 95% CI [0.13, 0.65], P = .003) and Visual Analog Scale score (SMD = -0.41, 95% CI [-0.62, -0.19], P = .0002) were significantly lower, while the short-term Constant score(SMD = 0.37, 95% CI [0.19, 0.55], P ConclusionPlatelet-rich plasma injection can effectively improve the short-term outcomes following arthroscopic repair of full-thickness rotator cuff tears, thus reducing the rate of retears, alleviating pain, and improving patients' shoulder function. Specifically, the clinical outcomes are better with the use of platelet-rich plasma in single-row fixation than in other fixation techniques. Therefore, platelet-rich plasma injection can be recommended as an adjuvant therapy in single-row repair for improved short-term results